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Abstract: Heavy resonances are an integral part of many extensions of the Standard Model. 
The discovery of such heavy resonances are a primary goal at the LHC and future hadron 
colliders. When a particle with TeV-scale mass decays into electroweak-scale objects, these 
objects are highly boosted and their decay products are then strongly collimated, possibly 
to an extent that they cannot be resolved in the calorimeters of the detectors any more. 
We develop taggers for electroweak-scale resonances by combining the good energy resolution 
of the hadronic calorimeter with the superior spatial resolution of the tracking detector. 
Using track-based techniques we reconstruct heavy W' and Z' bosons and constrain the 
branching ratio of the rare Higgs boson decay H —> ZA —> l + l~ jets. The taggers show 
a good momentum-independent performance up to very large boosts. Using the proposed 
techniques will allow experiments at the LHC and a future hadron collider to significantly 
extend its reach in searches for heavy resonances. 
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1 Introduction 

After the successful discovery of the Higgs boson during LHC run 1 [1, 2], the next foremost 
goal of the upcoming runs is to discover new physics, e.g. new heavy resonances. The scale 
of new physics for many anticipated extensions of the Standard Model has already been 
pushed beyond 0(1) TeV [3-6]. If heavy TeV-scale resonances decay into electroweak-scale 
particles which in turn have large branching ratios into quarks, i.e. AxeV —>• Lew —>• jets, 
these quarks are likely to be collimated in the lab frame. More precisely, the extent to which 
the decay products of the electroweak-scale resonances are collimated depends on the ratio 
between the mass of the heavy new-physics resonance and the electroweak scale. For central 
production one finds p±y ~ mx/2. As a result, for the decay products of Y their angular 
separation scales like Ai?j ets ~ 4my/mj. As electroweak-scale resonances have generically 
large branching ratios into quarks the reconstruction and detailed analysis of hadronic final 
states is at the core of the upcoming LHC program. 

In an experiment jets are reconstructed using infrared-safe jet algorithms [7-10]. Input to 
jet algorithms are so-called topo-clusters, objects constructed from long-lived particles’ energy 
deposits in the electromagnetic (e-cal) and hadronic (h-cal) calorimeter [11], The minimal 
transverse size for a cluster of hadronic calorimeter cells is 0.3 x 0.3 in (rj, </>), reached if all 
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energy after noise-subtraction is concentrated in one cell. To discriminate two jets the angular 
separation of their axes in the detector has to be at least A R = y/ (Ar/) 2 + (A< j)) 2 = 0.2. For 
most Standard Model processes at the LHC this angular resolution is sufficient to separate 
the decay products of electroweak resonances. However, when scales are vastly separated 
and either 4my/mx <C 0.2 or in general p± t x rnx the angular separation of the decay 
products can be too small to resolve them individually. While the overall energy deposit of 
highly boosted resonances can still be measured, the substructure, i.e. the energy sharing 
between the decay products, becomes oblivious and the ability to discriminate between a 
decaying resonance and QCD jets using jet substructure observables quickly deteriorates. 
Obviously, at a possible future 100 TeV proton-proton collider where larger rrix are probed 
and the rate for electroweak resonances with p±x rnx is bigger, this issue cannot be 
ignored. 

To recover reconstruction efficiency for highly boosted resonances and extend the multi¬ 
purpose experiments’ sensitivity in searches for heavy resonances to larger masses, smaller 
input objects to jet algorithms have to be used. The most popular taggers using jet substruc¬ 
ture are either based on jet-shape observables [12-15] or on calibrated subjets inside a fat 
jet [16-20], 

Following [21], we propose to use tracks instead of topo-clusters as input for resonance 
reconstruction methods 1 and we develop and refine dedicated reconstruction procedures for 
highly-boosted electroweak-scale resonances, e.g. W/Z bosons and top quarks. While we will 
focus on subjet based techniques the same approach can be used for jet shape observables 

[23]- 

The article is arranged as follows: In Section 2 we briefly describe the top-tagging al¬ 
gorithm dubbed High-pT TopTagger (HPTTopTagger) and discuss modifications regarding 
the proposal in [21]. In Section 3 we extend the concept of the HPTTopTagger to W and Z 
boson tagging and apply these taggers in heavy resonance searches at the LHC and a 100 TeV 
FCC-hh. In Section 4 we focus on a rare Higgs decay and show that in searches for very light 
resonances track-based reconstruction can be an indispensable tool in the upcoming LHC 
runs. Eventually we summarize our findings in Section 5. 

2 Tagging highly boosted top quarks 
2.1 The new default of the HPTTopTagger 

The HPTTopTagger of Ref. [21] reconstructs the three-prong substructure of hadronically 
decaying top quarks from charged tracks. These particle trajectories can be determined 
in a tracking detector to very high radial precision. More specifically, the ATLAS inner 
tracking detector achieves an angular reslution of A?y ~ 10~ 3 and A <f> ~ 0.3mrad for charged 
particles with p± = lOGeV [24], while maintaining a reconstruction efficiency of > 78% 

1 In an earlier proposal [22] the same approach for the electromagnetic calorimetry was discussed for W 

tagging. 
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for tracks of charged particles with p± > 500 MeV [25] . Such accuracy cannot be achieved 
with the hadronic calorimeter where it is not possible to separately resolve small jets with 
< 0.2. 

Tagging algorithms that rely solely on information from calorimeter towers on the other 
hand - such as e.g. the HEPTopTagger [19] - are therefore not applicable any more if the top 
daughter jets are strongly collimated. The track-based HPTTopTagger ports elements of the 
HEPTopTagger algorithm to the high-energy regime. However, to avoid combinatorial issues 
due to a high multiplicity of tracks and the introduction of artificial mass scales in background 
events, cuts are only applied on one three-subjet configuration in a large-radius fat jet. Hence 
we do not search for a top-like structure in every possible subjet combination. The bulk of 
the top identification is then achieved by comparing ratios of invariant mass combinations of 
the three subjets. For example, the ratio m 23 /mi 23 corresponds to mw/n^t hi most hadronic 
top decays, where 77123 is the invariant mass of the sub-leading and sub-sub-leading subjet in 
transverse momentum and 777,123 is the invariant mass of the top candidate. 

Due to imperfect knowledge of all energy flow in the tracking detector (only charged 
particles are reconstructed), the track momenta are scaled according to the inverse of the 
energy fraction carried by charged tracks [21, 22] 


a • = E]et 


-^tracks 


( 2 . 1 ) 


Because the energy of the (hadronic) fat jet can be calibrated to good precision in the ex¬ 
periment [26, 27], the sensitivity to fluctuations is hence ameliorated. In essence we propose 
to perform a local recalibration of track-based (sub)jets, according to the relative energy de¬ 
posit of tracker and calorimetry. Thereby we seek to benefit from the tracker’s spatial and 
the calorimeter’s improved energy resolution. 

In fact, calibrations of Cambridge-Aachen (C/A) [9] subjets are available for radius pa¬ 
rameters as small as 0.2 [27] which makes it beneficial to re-scale the charged tracks more 
locally. For boosted tops with transverse momentum at the TeV scale, typically two or all 
three top subjets can be resolved in this way and the fluctuations are expected to be reduced 
separately. In Figure 1 we show the average mutual separation between top decay products. 
Thus, among other improvements, the new new default of the HPTTopTagger applies the 
approach of Equation (2.1) to subjets individually instead of the whole fat jet. 

We find that the Monte Carlo truth partons from the decay t —> Wb -» jjb are separated 
by a characteristic R distance of ~ 200 GeV /p±,t and that the energy carried by charged tracks 
around these partons is very well localized with a much smaller radius, cf. Figure 2. Given 
these observations, we are led to abandon the mass-drop unclustering procedure which was 
inherited from the HEPTopTagger in favour of conventional (anti -kx) [10] subjets with radius 
parameter R = 100 GeV/p±j c . We label the three subjets leading in transverse momentum 
j i, j r 2 and J 3 . This procedure renders an additional filtering stage redundant. 

The inclusion of the leading gluon emissions is important to capture all top quark decay 
products, in particular when p±j mt- la [ 21 ] this was not done explicitly, which resulted in 
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Figure 1. Average smallest and largest mutual separation in A Rij = yA</>^- + Arjfj between the 
partons from the hadronic top decay t —> Wb —> jjb. 


smallest distance between partons Dtracks P_L around parton 




Figure 2. Smallest separation between two MC truth partons in a top decay, scaled with the fat 
jet transverse momentum (left) and distribution of transverse momentum carried by charged tracks 
around the truth partons (right). 


relatively low reconstructed masses. Additional soft emissions can however easily be captured 
if we allow our subjets to overlap, i.e. we recluster the fat jet j c with R = 0.8 • min A R(ji,jk) 
where i, k € {1, 2 , 3 | i / k}. As the new radius is smaller than the subjet separation, it is 
guaranteed that they are still resolved separately. The new pj_-leading subjets j i, j 2 and j '3 
form the top candidate. 

Typically tagging efficiencies will be larger for the new HPTTopTagger which is a desired 
feature, especially when the luminosity of the experiment is limited. To achieve good rejection 
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rates of background events, in addition to the so-called A-cut of mass ratios, we apply a cut 
based on the geometric size of the top candidate. The scaled spread of the reconstructed 
subjets has to satisfy 


max A R(ji,j k ) ■ p± t top candidate £ [ 250 GeV, 750 GeV] (2.2) 

where i, k £ {1,2,3 | i / A;}, in order to reject fake candidates where the big top-like mass is 
generated through soft large-angle radiation. 

We define the new default of the HPTTopTagger algorithm by the following procedure: 

1. Define a fat jet j using the C/A algorithm with R = 0.8 and p± > 800 GeV. 

2. Discard all tracks that are not associated with j or that have p± < 500 MeV. 

3. Scale the remaining track momenta as follows: Re-cluster j with the anti-/ex algorithm 
employing a small radius R = 0.2, and calculate ay = i^jet/^tracks for each subjet using 
its respective associated tracks. We multiply the momenta of the tracks by ay and 
recombine the scaled tracks to a track-based jet j c . 

4. Re-cluster j c using the anti-fey algorithm with R = 100 GeV/pj_y c . If there are fewer 
than three subjets we consider the tag to have failed. 

5. We then calculate the smallest pair-wise distance between the three leading subjets, 
r m i n = min A/?,y and re-cluster j c with a new radius R = 0.8r m j n . If the new three 
leading subjets result in an invariant mass around the top quark mass, m can didate £ 
nit =t 25 GeV, they form our top candidate. 

6. We follow the HEPTopTagger and apply the so-called A-cut of [19] on the pair-wise in¬ 
variant masses (mi 2 , 77113 , 77123 ). If in addition the top candidate satisfies p± - max A/?,,y £ 
[ 250 GeV, 750 GeV ], we consider the top tag to be successful. 

2.2 Performance 

We investigate the tagging efficiency and the reconstructed mass of the HPTTopTagger, as 
described in Section 2.1. Throughout this section, we generate top-initiated jets from the 
production of a Z', pp —» Z' —> it —>• jets, for masses in the range mz' = 3 • • • 6 TeV. Unless 
otherwise stated, the sample with mass mz' = 6 TeV is used to generate the plots. For 
the background we consider QCD jets from dijet production. Unless stated otherwise, we 
generate events with PYTHIA 8 [28] at centre-of-mass energy y/s = 14TeV. Fat jets are 
clustered using Fast Jet [29] from stable particles with pseudorapidity 1 77 I < 4.9. We apply 
the C/A jet algorithm with resolution parameter R = 0.8 and p± > 800 GeV. To assess the 
signal efficiency we match the fat jet to MC truth top quarks by requiring Ai?(j,top) < 0.6 
and select jets with | r/j | < 2.5. 
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A priory, constructing observables from tracks can result in a marked sensitivity on 
detector resolution and efficiencies. To evaluate the impact of these effects on the performance 
of the algorithm we use the Delphes [30] fast dectector simulation. Throughout parameters 
are chosen in accordance with the default ATLAS parameters provided by Delphes. 

Figure 3 shows the receiver-operater-characteristic (ROC) curve for our algorithm. By 
tuning the free parameters and cuts, background rejection R = 1 — e m i s can be maximized 
for any working point with given signal tagging efficiency e. We use TMVA [31] to find 
optimum cuts on 77123 / 772 - 123 , p± • max AR tJ , as well as the allowed mass windows around mt 
and Tfi\y . TMVA assesses the cut parameters by their respective discrimination power and 
applies cuts in this order. To further improve performance, we run the algorithm for different 
combinations of fixed and free cut parameters and combine the resulting curves such that at 
each working point, the setup with the largest background rejection is selected. To obtain a 
dropping p± distribution as expected in a real analysis, background QCD jets we obtain from 
dijet production with p_\_ > 700 GeV. p± is the transverse momentum in the rest frame of the 
hard process at generator level. 



Figure 3. ROC for scan over 77123 / 277123 , p± - max A Rij, mt, mw , A mt, A mw in various combinations 
and pre-scan cuts. For details on signal and background generation see Section 2.2. 

In Figure 4 we show the p_i_-dependent tagging efficiencies of the HPTTopTagger. Here 
we obtain background QCD jets from dijet production with binned generator-level p± in 
the range [700, 2500] GeV and p± > 2500 GeV to achieve good statistics in all bins. Over 
the whole studied p± range we find a fiat tagging efficiency and fake rate. The outlined 
modifications for the new default of the HPTTopTagger improve the tagging efficiency for 
fat jets with p± t ~ 1 — 2 TeV while maintaining a similar fake rate. For particle-level final 
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states with /j^jatjet ~ 1 TeV the HPTTopTagger has a signal efficiency of roughly 35% which 
slightly decreases to 30% at p_i_,f a tjet = 3 TeV. Including detector effects results in a flat shift 
to lower values by 5% over the entire pj^fatjet range. 


tagging efficiency (particle level) 



tagging efficiency (detector level) 



Figure 4. Tagging efficiency of the HPTTopTagger and the new default at particle level (l.h.s.) and 
detector level (Delphes) (r.h.s.). 

Figure 5 shows the reconstructed top mass after applying the HPTTopTagger in two 
different transverse momentum windows, p_i_,f a tjet £ [1000,1500] GeV (l.h.s.) and [2000, 
2500] GeV(r.h.s.). In the former, we generate background events with p± > 700 GeV and in 
the latter with p± > 1800 GeV, to obtain a dropping p± distribution as expected from the SM 
process. The top mass distribution from the signal sample shows a clear peak at the correct 
top quark mass. While the detector simulation does not affect the position and width of the 
peak significantly, it is slightly sharper and more pronounced for p_L,f a tjet £ [1000,1500] GeV 
compared to the window [2000, 2500] GeV. 

As track-based observables are not infrared safe [32] non-perturbative contributions have 
to be taken into account to obtain well-defined results. For practical purposes, in event 
generators, hadronization models including fragmentation functions are used, which in turn 
follow perturbative evolution equations to obtain an extrapolated result at energies of interest 
from LEP data. Thus, track-based observables are subject to the parametrisation of non- 
perturbative physics. 

To estimate the impact different hadronisation models can have on the performance of 
the HPTTopTagger we compare the tagging efficiency and mass reconstruction for events 
generated with either PYTHIA or HERWIG+-1- [33], see Figure 6. We find that both event 
generators result in very similar signal efficiencies, i.e. differences of 0(10)% at most. For the 
backgrounds the differences are slightly larger but still within generic uncertainties of event 
generators. 

2.3 Resonance search with highly boosted top quarks 

The discovery of heavy resonances is a prime goal at the LHC and a possible future high- 
energy proton-proton collider. In many extensions of the Standard Model heavy resonances 
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reconstructed mass (particle level) - p± (fat jet) G [1000,1500] GeV 


reconstructed mass (particle level) - p± (fat jet) G [2000,2500] GeV 




reconstructed mass (detector level) - p± (fat jet) G [1000,1500] GeV reconstructed mass (detector level) - p± (fat jet) G [2000,2500] GeV 




Figure 5. Reconstructed top quark mass with the new HPTTopTagger at particle level (upper row) 
and detector level (Delphes) (lower row) for different transverse momentum ranges pj^fatjet- 


tagging efficiency (particle level) 


reconstructed mass (detector level) - p± (fat jet) G [1000,1500] GeV 




Figure 6. Comparison of results for the new HPTTopTagger from samples generated with PYTHIA 
(black curves) and HERWIG++ (red curves). Tagging efficiencies and reconstructed mass are shown 
at detector level for both top-initiated and QCD jets. 


are predicted, leading to highly boosted top quarks. While for top quarks with intermediate 
boost focusing on (semi)leptonic top decays can lead to stronger exclusion limits [34], at 
very high transverse momentum standard lepton-isolation requirements fail and the improved 
background rejection in leptonic final states ceases to compensate for the smaller branching 
ratio. Thus, we will focus on hadronic top decays only. 
















































2.3.1 Results for LHC 


Figure 7 shows expected event rates at the LHC with 14TeV centre-of-mass energy and an in¬ 
tegrated luminosity of 300 fb _1 . We generate three signal benchmark scenarios with PYTHIA, 
choosing mz' = (3,4,5) TeV with production cross-sections uz> = (3.45,0.501,0.00962) fb 
and resonance widths Yz> = (95,128,160) GeV respectively. We generate QCD dijet back¬ 
ground events with p± > 700 GeV and find a total cross-section of 111.6 pb. 

To reconstruct the Z’ resonances we require at least two fat jets clustered using the C/A 
jet algorithm with R = 0.8. Input to the jet finder are all final state particles with \r]\ < 4.9, 
processed through Delphes. We further impose p_L,f a tjet > 800 GeV. 


Z' mass reconstructed with the HPTTopTagger (y/s = 14 TeV) 



Z' mass reconstructed with the HPTTopTagger (y/s = 100 TeV) 



m i2 [GeV] 


Figure 7. Invariant mass of reconstructed top quarks after top tag to 12 . The left figure corresponds 
to benchmark scenarios for yfs = 14 TeV and the right figure to yfs = 100 TeV. 

If both fat jets are tagged we reconstruct the resonance by summing the reconstructed 
top quarks’ four-momenta m\ 2 = (ptx + Pt 2 ) 2 , see Figure 7. While the resonance peak is 
clearly visible, it is shifted to values below the true resonance mass. In addition, for all 
resonance masses we find a long tail towards smaller mi 2 - The shift of the mass peak and 
the low-mass tail are more pronounced for heavier resonances. This is a known effect when 
reconstructing very heavy resonances. Top taggers aim to reconstruct top quarks that are 
close to being on-shell. However, top quarks produced in decays of heavy resonances have a 
large probability to radiate gluons before decaying. As these gluons are emitted in a wide 
angle around the top quark this radiation is lost when recombining the four-momenta of the 
reconstructed tops and their invariant mass is shifted towards smaller values. 

We evaluate the required integrated luminosity to exclude the three benchmark reso¬ 
nances using a simple cut and count method. Based on the invariant mass distribution of 
the two reconstructed top quarks mi 2 we choose the two bins with the best S/B ratio and 
require S/y/B ~ 2. 

For our benchmark scenarios, with only 300 fb -1 of data, a heavy mz’ = 3 TeV can be 
excluded at 90% CL. 

Based on Table 1 in Figure 7 we show the necessary integrated luminosity for exclusion 
at 90% CL, depending on the resonance’s production cross-section. 
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Resonance 

mi 2 window (TeV) 

S/B 

s/Vb 

cr (fb) 

a for S/ \[B = 2 

m' z = 3 TeV 

2.6-3.0 

0.27 

2.90 

3.45 

2.38 

m' z = 4 TeV 

3.4-4.0 

0.13 

0.69 

0.501 

1.45 

m' z = 5 TeV 

4.4-5.0 

0.07 

0.16 

0.00962 

0.12 


Table 1 . Results for search for Z' —► ti at the LHC14 in three benchmark scenarios. The last column 
shows the required production cross-section to achieve S/y/B = 2 with 300 fb. All numbers are 
based on the results provided in Figure 7. 

Exclusion limits for Z' at the LHC 14 Exclusion limits for Z' at the FCC-hhlOO 




Figure 8. Exclusion limits at 90% CL for Z' —> ti with the new HPTTopTagger depending on 
production cross-section and integrated luminosity. The left figure corresponds to benchmark scenarios 
for ^fs = 14 TeV and the right figure to yG = 100 TeV. 

2.3.2 Results for FCC-hh 100 TeV 

We generate events as described in Section 2.3.1 but with a centre-of-mass energy of \/s = 
100 TeV. We employ the same analysis to reconstruct the heavy Z' from two tagged tops, 
but now select fat jets with pyyatjet > 2000 GeV. 

Again, we generate three signal benchmark scenarios with PYTHIA, choosing mz 1 = 
(6,10,15) TeV with production cross-sections oz> = (26.9,0.986, 0.328) fb and resonance 
widths T Z ' = (192,322,485) GeV respectively. We generate QCD dijet background events 
with p_i > 2000 GeV and find a total cross-section of 149.3 pb. 

Expected event rates at 300fb _1 are shown in Figure 7 (r.h.s.) for all three benchmark 
masses and the QCD background. The mass bins that give the largest significance are listed in 
Table 2. Based on this table, we show 90% CL exclusion limits as a function of the production 
cross-section and integrated luminosity in Figure 8 (r.h.s.). 
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3 Tagging highly boosted gauge bosons 

3.1 The HPTWTagger and HPTZTagger algorithms 


Not only boosted top quarks are of interest for searches of new physics at the LHC and future 
colliders. Many extensions of the Standard Model predict heavy resonances decaying to W 
or Z bosons [35-37]. In addition, SM cross-sections for the emission of electroweak gauge 
bosons off quarks are strongly enhanced in energetic final states [38-40]. Thus, in this section 
we outline a proposal for a track-based tagger for highly boosted W and Z bosons. 

In the hadronic decay of boosted electroweak gauge bosons, —> qq' or Z —> qq, 

the two daughter partons exhibit a typical separation A R ~ ‘^wiwizlvi.yviz- A successful 
tagging algorithms has to resolve these subjets separately, where the finite resolution of the 
hadronic calorimeter becomes the bottleneck for large transverse momenta p± > 500 GeV. 
Following the arguments above, it is straightforward to adapt the HPTTopTagger to W and 
Z boson decays. Due to the two-prong nature of the process, we replace the cut on mass 
ratios (“A-cut”) by a cut on the momentum fraction of the leading subjet, 


fp± — 


P±,h 


P.L,W/Z candidate 


P-Lji 

P-L,jl+j2 


(3.1) 


Fake candidates from QCD jets can generate a IT- or Z-like mass via soft emissions and their 
f p± is thus dominated by values close to 1. 


We propose the following algorithm to reconstruct highly-boosted W and Z bosons: 

1. Define a fat jet j using the C/A algorithm with R = 0.5 and p± > 500 GeV. 

2. Discard all tracks that are not associated with j or that have p± < 500 MeV. 

3. Scale the remaining track momenta as follows: Re-cluster j with the anti-fey algorithm 
employing a small radius R = 0.2, and calculate ay = £j e t/-^tracks for each subjet using 
its respective associated tracks. The momenta of those tracks are multiplied by ay. 
Combine the scaled tracks associated with the fat jet to a track-based jet j c . 

4. Re-cluster j c using the anti-fen algorithm with R = 80 GeV /p±.j, • If there are fewer 
than two sub jets we consider the tag to have failed. 


Resonance 

mi 2 window (TeV) 

S/B 

s/VB 

cr(fb) 

a for S/\fB = 2 

m' z = 6 TeV 

5.0-6.0 

0.098 

3.29 

26.9 

16.39 

m' z = 10 TeV 

8 .0-10.0 

0.054 

099 

3.18 

6.45 

m' z = 15 TeV 

13.0-15.0 

0.052 

0.33 

0.466 

2.84 


Table 2. Results for search for Z' —> ti at the FCC-hhlOO in three benchmark scenarios. The last 
column shows the required production cross-section to achieve S/y/B = 2 with 300 fb” 1 . All numbers 
are based on the results provided in Figure 7. 
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5. Calculate the distance between the two leading subjets, r = A Rij and re-cluster j c with 
a new radius R = 0.8 r. If the combination of the new two leading subjets gives an 
invariant mass around the EW boson mass, m can didate £ m w/z A 10 GeV, they form our 
boson candidate. 

6 . If the momentum fraction of the two candidate subjets as defined in Equation (3.1) 
satisfies f p± < 0.85 (for W candidates) or f p± < 0.80 (for Z candidates), we consider 
the EW boson tag successful. 

If higher tagging efficiencies are required, we suggest to modify steps 5 and 6 and allow 
a larger candidate mass window, m can didate £ mw/z A 15 GeV, together with a looser cut on 
the subjet momentum fraction, f p± < 0.85. This setup is denoted “working point 2” (w.p. 2) 
below. 

We are not concerned with discriminating between W and Z candidates. While the mass 
peaks are fairly well separated (cf. Figure 10, Figure 13), one may also consider additional 
distinguishing features such as (sub)jet charge [41]. 

3.2 Performance 

We investigate the tagging efficiency and the reconstructed mass of the HPTWTagger as well 
as the HPTZTagger, as described in Section 3.1. 

The relevant signal fat jets for both taggers are taken from the production of a hypothet¬ 
ical charged heavy boson at the LHC, decaying into a W and a Z boson, 

pp -> W ,zt -> W ± Z . (3.2) 

We generate events with masses in the range mw = 3 • • • 6 TeV. We consider two semileptonic 
decay patterns, pp —> W !± W±Z —» (jj)(l + l~) to assess the HPTWTagger, and pp —> 
W'± —» W±Z —» for the HPTZTagger, respectively. Background events with a 

generator-level cut p± = 400 GeV • • • 2500 GeV we generate from pp —> Zj —> {l + l~)j for the 
HPTWTagger. The background process to assess the HPTZTagger is pp —> W±j —> 
with the same generator-level cuts. We generate all events with PYTHIA 8 [ 28 ] at centre-of- 
mass energy i/s = 14 TeV. Fat jets are clustered from stable particles with pseudorapidity 
|? 7 | < 4.9 using the C/A jet algorithm with resolution parameter R = 0.5 and p± > 500 GeV. 
To assess the signal efficiency we match the fat jet to MC truth W / Z bosons by requiring 
AR(j, W/Z) < 0.4 and select jets with \r]j\ < 2 . 5 . Additionally, we require the fat jet to be 
isolated from the MC truth leptons from the other gauge boson. For a leptonically decaying 
Z boson, the condition is A R(j, lepton) > 0 . 6 . The isolation criterion for a leptonic W boson 
decay reads AR(j, lepton/neutrino) > 0 . 6 . 

Figure 9 shows the efficiencies for tagging highly boosted W bosons with the HPTWTag¬ 
ger and background rnistag rates for both working points suggested in Section 3.1. Results 
are presented at particle level (left) and detector level after running Delphes (right). We use 
all event samples as listed above to generate the plots in order to achieve good statistics over 
the whole p± range. 
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We show the reconstructed W boson mass in Figure 10 in two different p± bins, also 
both at particle and detector level. In the p± E [1000,1500] GeV bin (left), we only consider 
the m\y' = 3 TeV sample for signal events. In order to obtain a dropping p± distribution as 
expected from the SM process, we generate background events with fixed generator-level cut 
p_ l > 1000 GeV . At higher transverse momenta in the p± E [2000,2500] GeV bin (right), we 
simulate signal events with mw' = 5 TeV and for the background we impose a generator-level 
p_ l > 2000 GeV. 


tagging efficiency (particle level) 


tagging efficiency (detector level) 




Figure 9. Tagging efficiency of the HPTWTagger at particle level (l.h.s.) and detector level (Delphes) 
(r.h.s.) for boosted W bosons (solid lines). Standard Model QCD background mistag rates are given 
by the dashed lines. Results are shown for the standard setup (black) and the higher-efficiency working 
point 2 (red). 


In Figure 11 we compare the tagging efficiency of our HPTWTagger to a BDRS-like 
algorithm [42], The latter sequentially unclusters the fat jet, discarding the less heavy subjet 
at each step until a mass drop is found, 


maxrn 


rrij 


^ < 0.67 , 


min(p^ j.)AR 


hi 'a nirnpij. 


> 0.09. 


(3.3) 


Vj nij maxp_L,ii 

If no mass drop is found, the candidate is discarded. The two subjets are then filtered with 
a radius 


-^filtering = max (0.2, min(0.3, AR jlj2 /2)) (3.4) 

and the three hardest fitered subjets are summed to form the W boson candidate. In ac¬ 
cordance with out algorithm, we consider the tag successful if the invariant mass lies within 
mw ±10 GeV. 

For the performance of the BDRS algorithm as implemented according to Equations (3.3) 
and (3.4) quickly deteriorates for increasing jet p±, see Figure 11. For p±j > 1500 GeV it is 
more likely to obtain a W-tag by with a QCD jet than with a W decay. While optimising 
the parameters entering Equation (3.3) might help to recover efficiency, the minimal angular 
separation of calibrated filtered subjets ARjij2 > 0.2 eventually limits the applicability of 
this tagger for highly boosted resonances. 
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Figure 10. Reconstructed W boson mass with the HPTWTagger at particle level (upper row) and 
detector level (Delphes) (lower row) for different pj^fatjet- 
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Figure 11 . Comparison of the tagging efficiencies of the HPTWTagger (black lines) and a BDRS-like 
tagger (red lines) at detector level (Delphes) for boosted W bosons. Standard Model QCD background 
mistag rates are also given. 


We show the boosted Z boson tagging efficiencies of the HPTZTagger in Figure 12 for 
both working points and both at particle level (left) and detector level (right). Again, tag- 
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ging efficiency and background rejection are stable over a large range of transverse momentum 
from p_i = 500 GeV up to 3 TeV. Throughout the remainder of this section, the cuts imposed 
at generator level, i.e. signal m-w 1 and p± for the background, are the same as in the corre¬ 
sponding plots of the HPTWTagger. 

Figure 13 depicts the reconstructed mass of the boosted Z boson. Both in the highly 
boosted (left) and very highly boosted regimes (right), a peak at the MC truth mass is found 
over a flat background mass distribution. We consider signal events with rn\y = 3 TeV 
(5 TeV) and background events with p > 1000 GeV (2000 GeV) in the p± € [1000,1500] GeV 
([2000,2500] GeV) bins to approximate realistic p± distributions. No degredation is found in 
the reconstructed mass when going to high fat jet transverse momentum. 


tagging efficiency (particle level) 



fat jet p± 


tagging efficiency (detector level) 



fat jet p± 


Figure 12. Tagging efficiency of the HPTZTagger at particle level (l.h.s.) and detector level (Delphes) 
(r.h.s.) for boosted Z bosons (solid lines). Standard Model QCD background mistag rates are given by 
the dashed lines. Results are shown for the standard setup (black) and the higher-efficiency working 
point 2 (red). 

3.3 Resonance search with highly boosted gauge bosons 

Production of a charged heavy boson, pp -» W'± —> W±Z, as discussed in Section 3.2 is 
ideally suited for resonance searches at the LHC. We investigate W' masses in the range 
mw = 3 • • • 6 TeV. Again we consider two semileptonic scenarios, depending on whether the 
W boson or the Z boson decays hadronically. In either case, a fat jet is formed with the C/A 
algorithm with parameters R = 0.5 and p± > 800 GeV. The other gauge boson we force to 
decay leptonically and we assume perfect reconstruction of the visible electrons and muons. 

In the first scenario pp —> IV /=t —> W±Z —> (), we apply the HPTWTagger to tag 
and reconstruct the W boson. Identification of the decay of the Z boson into a pair of charged 
leptons (e + e _ , p. + p~) reduces the relevant Standard Model backgrounds to pp — > Zj. Here 
we assume perfect reconstruction of the leptons from the Z boson decay. A fat jet we reject 
if close to any of the leptons from the Z decay, AR(j, lepton) < 0.6. We generate background 
events in bins of generator-level p± ranging from 700 GeV to 2.5 TeV, and p± > 2.5 TeV. 

The heavy W' resonance is reconstructed as the vectorial sum of the tagged W and 
Z four-momenta. Its invariant mass we plot in Figure 14 (l.h.s.), showing expected event 
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Figure 13. Reconstructed Z boson mass with the HPTZTagger at particle level (upper row) and 
detector level (Delphes) (lower row) for different pj^fatjet- 

rates for an integrated luminosity of 300 fb _1 . For all benchmark masses, the signal peaks 
outnumber the SM background in the respective mass bins. 

The second scenario is given by a leptonically decaying W boson, pp —> W r± — > W±Z —>■ 
(l^v) ( jj) and we apply the HPTZTagger to the hadronic decay of the Z boson. The relevant 
Standard Model background is pp —> W ± j in the highly-boosted regime, again generated in 
generator-level p± bins in the range [700 GeV, 2.5 TeV] and p± > 2.5 TeV. 

To recover the four-momentum of the W boson, we assume perfect reconstruction of the 
charged lepton and obtain the four-momentum of the invisible neutrino as follows. Standard 
jets (C/A, R = 0.4, p± > 30 GeV) are used to determine the missing transverse momen¬ 
tum. Jets in the vicinity of the charged lepton are discarded if A Rji < 0.5. The neutrino 
transverse momentum is then given by the negative of the summed jet transverse momentum. 
Imposing zero invariant mass of the neutrino and fixing the W mass (pi +p u ) 2 = rn|y, we can 
determine the remaining components of the neutrino momentum. Of the two solutions of the 
quadratic equation for p z , we choose the one with smaller A Ri v . Due to imperfect knowledge 
of transverse momentum, there may be no real solution. In this case we simply take the real 
part as p z . 

Candidate fat jets for Z boson tagging are rejected if not well-separated from the decay 
products of the W boson, Ai?(j, lepton/neutrino) < 0.6. We then obtain the tagged mass 
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Figure 14. Invariant mass of a heavy boson W ,=t —> W ± Z for three different resonance masses 
mw' = 3TeV (red), mw' = 4TeV (blue), and rriy/' = 5TeV (green). In the left panel the hadronically 
decaying W boson is reconstructed with the HPTWTagger, while perfect reconstruction is assumed 
for the leptonically decaying Z boson. In the right panel the Z boson decays hadronically and is 
reconstructed with the HPTZTagger, while the leptonically decaying W boson is reconstructed as 
described in the text. The relevant Standard Model backgrounds consist of Z+ jets (left) and W +jets 
(right) respectively, and are depicted as black dashed lines. Event rates are given for pp collisions at 
yfs = 14 TeY and an integrated luminosity of 300 fb -1 . 


Resonance 

mi 2 window (TeV) 

S/B 

s/Vb 

cr(fb) 

a for S/yfB = 2 

rniy> = 3 TeV 

2.9-3.1 

10.1 

11.9 

0.139 

0.0233 

mw' = 4 TeV 

3.9-4.1 

15.8 

5.1 

0.0192 

0.0075 

mw 1 = 5 TeV 

4.9-5.1 

17.3 

1.6 

0.00322 

0.0040 


Table 3. Expected event rates for a heavy W' reconstructed with the HPTWTagger at 300 fb 1 


Resonance 

7?ii2 window (TeV) 

S/B 

s/Vb 

u(fb) 

a for S/^B = 2 

771 iy/ = 3 TeV 

2.9-3.1 

3.80 

10.6 

0.455 

0.086 

mw 1 = 4 TeV 

3.9-4.1 

2.78 

2.92 

0.0631 

0.043 

mw' = 5 TeV 

4.9-5.1 

0.087 

0.029 

0.0108 

0.743 


Table 4. Expected event rates for a heavy W' reconstructed with the HPTZTagger at 300 fb 1 . 

of the heavy W' resonance from the reconstructed W and Z bosons, m^ v , = (pw + Pz) 2 , 
see Figure 14 (r.h.s.). 

As was done in Section 2.3, we evaluate the required integrated luminosity to exclude 
the three benchmark resonances using a cut and count analysis. We select the two bins 
with largest significance (S/y/B) for the reconstructed W' with the HPTWTagger in Table 3. 
Using the HPTZTagger on the second scenario, we find less significant results, see Table 4. 

Based on the results of the pp —> W :± —> W^Z —> process, we give the 

required integrated luminosity to exclude a heavy W' resonance at the LHC depending on 
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the production cross-section in Figure 15. 


Exclusion limits for W' at the LHC14 



C [fb- 1 ] 


Figure 15. Exclusion limits at 90% CL for the process pp —> W f± W ± Z at yfs = 14TeV. The 
hadronically decaying W boson is tagged and reconstructed with HPTWTagger. 


4 Tagging highly boosted scalars at the LHC 

After the recent discovery of a Higgs boson, i.e. the first electroweak-scale scalar resonance, 
the scalar sector has become the centre of the focus of both multi-purpose experiments and the 
theory community. Whether the Higgs boson is part of a minimal or non-minimal Higgs sector 
remains to be determined. While many popular models predict several scalar resonances, 
e.g. 2HDM or NHDM, their masses are a priori largely undetermined. 

To give an example for the benefit of using a track-based tagger for scalar resonances at 
the LHC with y/s = 14 TeV, we will focus on the rare prompt decay of the Higgs boson into 
a Z boson and a cp-odd light scalar H —> ZA. The mass of the cp-odd scalar is assumed 
to be rriA = 2 GeV. Hence, A is likely to be highly boosted as mu /2 3> rri a • We assume A 
to decay into gluons exclusively. To cope with large backgrounds we study the Higgs boson 
produced in association with a leptonically decaying Z boson. 

The signal we generate using PYTHIA and for the background process ZZj we use 
SHERPA [43]. Including decays of the Z bosons to electrons or muons and imposing p±j > 25 
GeV, we find a LO cross-section of <Jzzj = 31.69 fb. 

We require exactly four leptons (electrons or muons) with p±i > 10 GeV. To pair the 
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(4.1) 


four leptons to two Z bosons we minimize 


2 _ ~ m z ) , (m Wm - m z f 

XZZ ~ - To-r 


A 2 

^z 


A 2 


where % 7 ^ j ^ n ^ m. and A z = 5 GeV. We then require both lepton pairs to be in a mass 
window of mz ± 5 GeV individually. We then remove the leptons from the final state objects 
and use the remaining objects in \g\ < 5 to cluster C/A jets with R = 0.4 and p±j > 30 GeV. 
Next we need to identify the jet containing the decay products of the cp-odd scalar. Thus, 
we select this jet by minimizing 


Xjz = min(| m jtZl ~ m H \, \m jt z 2 ~ m H \) (4.2) 

and veto events where Xjz, min > 15 GeV. We find that this requirement selects the jet with 
smallest distance to the cp-odd scalar efficiently, see Figure 16. For the signal jet we expect a 
very narrow pencil-like substructure with a high localised energy density, similar to hadronic 
r jets. To further remove background contamination we require at least two charged tracks 
associated with the jet and eventually use all charged tracks as constituents to construct j c . 
We veto events if mj c > 3 GeV. 



Figure 16. Angular distance between the selected jet and the true cp-odd scalar A. 

Using the reconstruction efficiencies e in Table 5, with a^z = 883.0 fb and assuming 
BR(A —> gg) = 1 we can set a limit BR (H —> ZA ) < 0.001 for 100 fb -1 . 

5 Summary and outlook 

The reconstruction of heavy resonances is of great importance for the success of the upcoming 
LHC runs and possible future colliders. When heavy resonances decay into electroweak-scale 
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2 Z bosons 

fat jet 

Xjz < 15 GeV 

^tracks > 1 and m jc < 3 GeV 

(H -> ZA)Z 

0.513 

0.296 

0.221 

0.131 

ZZj 

0.497 

1.61 • 10“ 3 

4.84 • 10 -4 

5.43 • 10~ 5 


Table 5. Reconstruction efficiencies e for analysis steps described in Section 4. 

objects, these objects are highly boosted. The subsequent decay products are then very 
much collimated and inevitably merge to a single jet, unable to be separately resolved by the 
detector’s calorimeter. For the ATLAS and CMS detectors at the LHC, this angular resolution 
scale corresponds to heavy resonances with mass > 2 TeV when the decay products are around 
the electroweak scale. 

We developed dedicated tagging algorithms for the hadronic three-prong decay of the 
top quark (HPTTopTagger) as well as for the hadronic two-prong decay of W and Z bosons 
(dubbed HPTWTagger and HPTZTagger). Our algorithms show stable efficiencies and kine¬ 
matic reconstruction up until boosts of several TeV. The taggers combine the good energy 
resolution of the calorimeter with the very fine spatial resolution of the tracker. As only 
charged particles leave a signal in the tracking detector, this partial information alone is in¬ 
sufficient. We apply optimized substructure techniques on highly boosted fat jets and find 
good discrimination power against QCD-initiated jets, which form the dominant background. 

Track-based tagging algorithms significantly increase the discovery reach of heavy reso¬ 
nances into the multi-TeV regime. We showed that already in the early stage of run II at 
the LHC, a heavy Z' gauge boson decaying into a pair of top quarks can be excluded up 
until mz' = 3 TeV, resulting in a much stricter bound than current exclusion limits. A future 
proton-proton collider with centre-of-mass energy \fs = 100 TeV is capable of probing heavy 
resonances of multiples of 0(10) TeV. Applying the HPTWTagger and HPTZTagger to a 
search for a heavy charged boson W', we find an exclusion reach up to m\y> = 4 TeV in the 
early stage of LHC run II. Probing such large resonance masses is not possible with current 
tagging algorithms based solely on jets built with calorimeter information. 

We also showed that the general arguments on spatial resolution do not apply exclusively 
to very heavy resonances by constraining the branching ratio of the 125 GeV Higgs boson into 
a Z boson and a very light cp-odd scalar. The essential quantity that determines whether or 
not charged tracks have to be taken into the picture is the mass ratio of the heavy resonance 
X and the intermediate resonance Y that ultimately decays into the observed jets. Whenever 
mx/iTiy ~ 20, reconstruction methods relying on calorimeter-based jets break down and 
track-based observables start to become indispensable. 
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